Web Usage Mining Using Support Vector Machine

نویسنده

  • Sung-Hae Jun
چکیده

The web contains rich and dynamic collections of hyperlink information, web page access, and usage information providing rich sources for data mining. From this, we need a system to recommend a visitor good information. This recommendation system can be constructed by web usage mining process. The web usage mining mines web log records to discover user access patterns of web pages. Also it is the application of data mining techniques to large web log data in order to extract usage patterns from user’s click streams. In general, the size of web log records is so large that we have difficulty to analyze web log data. To make matter worse, the web log records are very sparse. So it is very hard to estimate the dependency between the web pages. In this paper, we solved this difficulty of web usage mining using support vector machine. In the experiments, we verified our proposed method by given data from UCI machine learning repository and KDD cup 2000.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Support Vector Clustering for Web Usage Mining

This paper applies the use of support vector clustering (SVC) in the domain of web usage mining. In this method, the data points are transformed to a high dimensional space called the feature space, where support vectors are used to define a smallest sphere enclosing the data. A soft-margin constant is used to handle outliers. The paper then performs experiments to compare SVC and the K-Means a...

متن کامل

Mining Biological Repetitive Sequences Using Support Vector Machines and Fuzzy SVM

Structural repetitive subsequences are most important portion of biological sequences, which play crucial roles on corresponding sequence’s fold and functionality. Biggest class of the repetitive subsequences is “Transposable Elements” which has its own sub-classes upon contexts’ structures. Many researches have been performed to criticality determine the structure and function of repetitiv...

متن کامل

High performance of the support vector machine in classifying hyperspectral data using a limited dataset

To prospect mineral deposits at regional scale, recognition and classification of hydrothermal alteration zones using remote sensing data is a popular strategy. Due to the large number of spectral bands, classification of the hyperspectral data may be negatively affected by the Hughes phenomenon. A practical way to handle the Hughes problem is preparing a lot of training samples until the size ...

متن کامل

Sentiment Analisis on Web-based Reviews using Data Mining and Support Vector Machine

This work aims to use sentiment analysis techniques, data mining, text mining and natural language processing to indicate the polarity of texts using support vector machine. Weka software and a movie review database from Internet Movie Database IMDb were used. This work uses preprocessing filters and WRAPPER techniques and Support Vector Machine (SVM) for classification. It presents better resu...

متن کامل

Prognosis of multiple sclerosis disease using data mining approaches random forest and support vector machine based on genetic algorithm

Background: Multiple sclerosis (MS) is a degenerative inflammatory disease which is most commonly diagnosed by magnetic resonance imaging (MRI). But, since the MRI device uses of a magnetic field, if there are metal objects in the patient's body, it can disrupt the health of the patient, the functioning of the MRI, and distortion in the images. Due to limitations of using MRI device, screening ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005